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Abstract 

Adiabatic quantum optimization has attracted a lot of attention because small scale simulations 
gave hope that it would allow to solve NP-complete problems efficiently. Later, negative results 
proved the existence of specifically designed hard instances where adiabatic optimization requires 
exponential time. In spite of this, there was still hope that this would not happen for random 
instances of NP-complete problems. This is an important issue since random instances are a good 
model for hard instances that can not be solved by current classical solvers, for which an efficient 
quantum algorithm would therefore be desirable. Here, we will show that because of a phenomenon 
similar to Anderson localization, an exponentially small eigenvalue gap appears in the spectrum 
of the adiabatic Hamiltonian for large random instances, very close to the end of the algorithm. 
This implies that unfortunately, adiabatic quantum optimization also fails for these instances by 
getting stuck in a local minimum, unless the computation is exponentially long. 
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I. INTRODUCTION 



Adiabatic quantum computing is a computational paradigm (introduced in [T]) where the 
solution to an optimization problem is encoded in the ground state of some Hamiltonian 
Hp. An adiabatic algorithm would proceed as follows: prepare the ground state of another 
Hamiltonian Hq (chosen so that its ground state is easy to prepare), then slowly modify the 
Hamiltonian of the system from to ifp, using an interpolation H{s) = (1 — s)Hq + sHp. 
If this is done slowly enough, the Adiabatic Theorem of Quantum Mechanics 12] ensures that 
the system will stay close to the ground state of the instantaneous Hamiltonian throughout 
the evolution, so that we finally obtain a state close to the ground state of Hp. At this point, 
measuring the state will give us the solution of our problem with high probability. To put 
this in more quantitative terms, if the problem size (number of bits) is A^, then one requires 
the instantaneous eigenvalue gap A(s) between the ground state and the first excited state 
to be inverse polynomial in A^ at each step s of the evolution. The computation time T 
scales as the inverse square of the gap T ~ where A = min^ A(s). This means that 

if the eigenvalue gap becomes exponentially small at any point in the evolution, then the 
computation requires exponential time. This eigenvalue gap provides a possible advantage 
of adiabatic quantum computing compared to the usual model based on quantum circuits. 
Since the system stays in its ground state throughout the evolution, robustness against 
thermal noise and decoherence could be provided by the eigenvalue gap [31 IH E] • It was also 
shown that adiabatic quantum computing is universal for quantum computing i-e., any 
algorithm expressed as a quantum circuit may be translated into an adiabatic algorithm, 
and vice versa. 

Adiabatic quantum computing was first proposed as a new approach to solve hard op- 
timization problems, and has attracted a lot of attention because numerical evidence [7] 
seemed to indicate that the time required to solve NP-complete problems scaled only poly- 
nomially with the problem size, at least for small sizes. But later work gave strong evidence 
that this may not be the case. Refs. [HI E] show that adiabatic algorithms can fail if one 
does not choose the initial Hamiltonian carefully by taking into account the structure of the 
problem. In Ref. [10] the Hamiltonians of certain instances of 3-SAT were mapped to an 
Ising model and diagonalized analytically. It was shown that the gap is exponentially small 
in some cases. Refs. [TTj and [12] construct special instances of 3-SAT which are hard for the 
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adiabatic algorithm to solve. More recently, it was shown that very small gaps could appear 
in the spectrum of the Hamiltonian due to an avoided crossing between the ground state 
and another level corresponding to a local minimum of the optimization problem [Ti] . 
However, these results show the failure of adiabatic quantum optimization for specifically 
designed hard instances. In this paper, we show that adiabatic quantum optimization fails 
with high probability for randomly generated instances of the NP-complete problem Exact 
Cover 3 (EC3), also known as l-in-3 SAT. Since the core of the argument leading to this 
conclusion only relies on general properties shared by other NP-complete problems such 
as 3-SAT, this provides a strong evidence that adiabatic quantum optimization typically 
cannot solve hard instances of NP-complete problems efficiently. 

Our argument relies on different elements. In Section [IT], we introduce the problem Exact 



Cover 3 and the adiabatic algorithm proposed to solve it. In Section |III[ we study some 
statistical properties of random instances of EC3, which will be crucial to the result. In Sec- 



tion IV, we study the perturbation expansion of eigenenergies of the adiabatic Hamiltonian. 
In Section |V| we show that perturbation theory predicts an avoided crossing and therefore 
a small gap occurring close to the end of the adiabatic evolution, where perturbation theory 
becomes valid. We also performed numerical simulations to confirm the predictions of per- 



turbation theory and estimate the position of the avoided crossing. Finally, in Section VI 



we show that this avoided crossing induces an exponentially small gap. Since these results 



rely on perturbation expansions, we discuss in Section VII the applicability of perturbation 
theory. We show how this problem is intimately related to the phenomenon of Anderson lo- 
calization [15] , which implies that the eigenstates of the Hamiltonian are localized for small 
perturbation, corresponding in our case to the end of the algorithm, close to s = 1. An 
important observation is that the adiabatic Hamiltonian H{s) has exactly the same form as 
the model used by Anderson to describe localization, except that the particle evolves on an 
A^-dimensional hypercube instead of a d-dimensional lattice. This emphasizes the relevance 
of Anderson localization for the study of quantum algorithms, even though it is currently 
not commonly used in this context, one example being Ref. |16] where it has been used to 
show weaknesses in quantum walk algorithms. 
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II. PRELIMINARIES 

A. Exact Cover 

Exact Cover 3 (ECS) is an NP-complete problem which was considered for adiabatic algo- 
rithms in [7J. Consider an A^-bit string x = (a;i, X2, . . . ,xn), where Xi G {0, 1}. An instance 
of ECS consists of many clauses each containing three bits. A clause C = {xi^,Xj^,Xkc) is 
said to be satisfied if and only if one of the three bits is one and the other two are zero, i.e., 
Xi^ + Xj^ + Xk(y = 1. A solution is an assignment of the bits which satisfies all the clauses. 
The solutions of an instance of ECS can be encoded into a cost function given by 

/W = ^i^ic + ^jc + ^kc - 1)^- (1) 

c 

A solution is therefore an assignment which yields a zero cost. 

We will call random instances of ECS with bits and M clauses those generated by 
picking uniformly at random M clauses of three bits, with replacement. This distribu- 
tion of instances is important since it generates hard instances that can not be solved by 
current classical solvers. The hardness of random instances depends highly on the clauses- 
to- variables ratio a = ^. As a increases, we observe two phase transitions [T71I18]. For low 
a, the density of the solutions is high and essentially uniform. As a increases to the cluster- 
ing threshold Oc, the first phase transition occurs where the problem goes from having many 
solutions to clustered solutions with different clusters isolated from each other. The other 
phase transition occurs at the satisfiability threshold as after which the problem is unsatisfi- 
able with high probability. Therefore, the hard instances with only a few isolated solutions, 
which cannot be solved efficiently by known classical algorithms, lie just before this second 
phase transition, and this is the regime we will be interested in. The satisifiability threshold 
of ECS has been studied in ^19j, where it is shown that = 0.626S ± 10"*^. 

B. Adiabatic Algorithm for Exact Cover 

To design an adiabatic quantum algorithm for this problem, we build a problem Hamilto- 
nian Hp acting on a space of qubits such that each state |x) of the computational basis is 
an eigenstate of Hp with energy = /(x). Using the mapping Xi (1 — az^)/2 in Eq. 
where ai*^ is the Pauli operator acting on the i-th bit, we get the following expression for 
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1), 



the problem Hamiltonian 

N N 

Hp^Ml--Y, B,af + - ^ J,,a»a W , (2) 

where 1 is the identity operator, M is the total number of clauses, is the number of 
clauses in which the bit i participates and Jy is the number of clauses where the bits i and 
j participate together (so Jij = Jji and we set Ju — for convenience). The solution to the 
ECS instance is now given by the ground state of Hp. As for the initial Hamiltonian Hq, a 
standard choice is 

N 

Ho--J2^^\ (3) 

i=l 

where cr^'* is the Pauli operator on the i-th bit. Hq is therefore a 1-local Hamiltonian 
accepting as unique ground state the uniform superposition 

|V^o) = 2-f Yl 1^)- (4) 
xe{o,i}^ 

The adiabatic quantum algorithm consists in preparing l-^o), applying the Hamiltonian Hq 
and slowly modifying the system Hamiltonian, following an interpolation 

H{s{t)) = (1 - s{t))Ho + s{t)Hp, (5) 

where s{t) = t/T and T is the computation time. Let A(s) be the eigenvalue gap between 
the ground state and the first excited state of H{s). If T is large compared to 1/A^, where 
A = min^ A(s), we will obtain a state close to the ground state of Hp at the end of the 
evolution, so that a measurement in the computational basis will yield the solution to the 
problem with high probability. To evaluate the complexity of this algorithm, we therefore 
need to find the minimum gap of H{s). 



III. STATISTICAL PROPERTIES OF RANDOM INSTANCES 

Since we are are interested in random instances of ECS, our results will rely on some 
statistical properties of such instances. Recall that a random instance is obtained by picking 
uniformly and independently M clauses of 3 bits among a set of N bits. Let us study the 
statistical properties of such instances in the hmit of large N, for fixed clauses-to-variables 
ratio a — ^. 
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From Eq. g, we see that an ECS instance over bits is completely specified by the 
N X N matrix {Jij)fj^^ (note that Bi = ^ J2j Jij ^"^^ ^ ~ \ Tli ^i)- Such a matrix defines 
a graph G over vertices such that there is an edge if and only if Jij ^ 0. We will 
now show that for random instances, the local properties of G are independent of A^. 

Let us first focus on the degree of the graph. Since each clause involves two other bits, 
the degree of vertex i is at most twice Bi, the number of clauses involving bit i. Since the 
probability that bit i appears in one random clause is and the M clauses are picked 

uniformly at random and with replacement, Bi follows a binomial distribution 

In the limit N ^ oo, for fixed b and a = ^, we have 

lim Ft[B, = b] = e-3"M!. (7) 

The fact that this distribution converges for large A^, as well as other properties of the 
graph G, will be crucial to our results. In particular, the fact that Bi follows the binomial 
distribution in Eq. ^ immediately implies the following (we denote by {V) and cr^ (V) the 
mean value and variance of a random variable V^). 

Fact 1. For random ECS instances with a = ^ , we have {Bi) = 3a and a"^ [Bi) = 3a(l — -^). 

From Markov's inequality, this implies that Bi, and in turn the degree of each vertex, 
remains bounded with high probability in the limit N oo. 

From Eq. ([7|, we also see that limAr_»oo Pr[-Bi = 0] = e~^", so that a given bit will not 
appear in any clause with probability e~^". This implies that when we generate a random 
instance with A^ bits, only a fraction of the bits will actually play a role in the instance. For 
a given random instance, let A^' be the number of bits present in some clause. From Eq. (|6]), 
we can show that the fraction of present bits N' /N becomes more and more peaked around 
its mean value 1 — e~^" in the limit A^ oo. 

Fact 2. lim^^oo ^ = 1 - e'^" and lim^^o^ = e-3°(l - e-^"). 

For any set of bits S C [A^] , let us define the induced subgraph Gs as the graph on the 
set of vertices S such that (i, j) G 5 x 5 is an edge of Gs if and only if it is also an edge of 
G. When there is no ambiguity, we will sometimes use S to denote the subgraph itself. For 



a given ECS instance on bits, we denote by Qu the set of subsets 5* C [A^] of size u whose 
associated subgraphs are connected, or in short the set of connected graphs of size u. Let 
Cm = I I be the number of connected graphs of size u. We will later use the fact that G„ 
is linear in N (the proof is given in Appendix [A]) . 

Lemma 1. For any m G N, we have {Gy} = Q{N). 



IV. PERTURBATION THEORY FOR THE ADIABATIC HAMILTONIAN 
A. Perturbation theory using Green's functions 

In the following sections, we will show that the Hamiltonian H{s) exhibits an exponen- 
tially small gap close to s = 1. To study the spectrum of H{s) around s = 1, let us consider 
the Hamiltonian H{\) = ^^^^ = Hp + XV, where A = ^ and V = Hq acts as a time in- 
dependent perturbation on Hp. We describe how the spectrum of this Hamiltonian can be 
written as a perturbation expansion in powers of A. Let |x) be a non-degenerate eigenstate 
of Hp with energy Ey_. We define the self-energy as 

oo 

S,,(E) = 5^A^S(«)(E), (8) 

q=l 

where 

Vy^yj = (y*|^|y'')) ^he sum in the last expression is over all eigenstates of Hp different 
from X. The perturbed eigenvalue Ey^{X) is then given by the pole of the Green's function 

Therefore, a perturbation expansion 

oo 

E.(A) = E,, + 5^A^4'^) (11) 

q=l 

may be obtained by solving the equation E = E^ + T,y^{E) recursively using the perturbation 
expansion ([s]) for the self-energy. The energy up to first order is 

E^iX) =E^ + AS«(E,) + 0(A2), (12) 



so that E^^ — Ex\ where, when not exphcitly written, all Ex^(-E') (and later also their 
derivatives) are evaluated E — E^. The energy up to second order term is 

= + AEW + A^ (EpEi^) + Si^)) + 0(a3), (13) 

where we have used Taylor series expansion and kept terms up to second order in A. The 
second order correction E^^ is then given by the coefficient of A^. Similarly, the energy up 
to third order is given by 

i^x(A) = i?x + AE« [E^ + AS« + A^ ip^^'T}^' + E^^^)) 
+A2Ef (£;, + AEi^)) + X'T.^^\E^) + O(A^) 
^ E^ + AE« + A^ (E«'E« + E(2)) 

(^((S«)^)" + (S«E(f))' + E(f)) + O(A^). (14) 

We can now take V = Hq = — and give the first few orders of the expansion. 

First note that in this case t!^\e) = for every odd q. This is because (x|ifo|y) 7^ if and 
only if X and y differ by one bit. Since at least one Ex (or some derivative of it) of odd 
order q' will appear in each term of the correction E^^ for odd order q, all odd orders in the 
perturbation expansion vanish. In this case, the first three non-zero terms are given below. 

= E(^) (15) 

Ei'^ = + (16) 

E^^^ - ^((4^^)T + (4^^4^^)' + 4^^- (17) 

We would like to express each of these corrections as a sum over paths going from the 
assignment x and back. Consider 

Since the only non-zero terms arise when y is a single bit fiip away from x, we can think of 

(2) 

£'x as a sum over all paths going from the assignment x and back which consist in fiipping 
(and fiipping back) only one bit. Similarly, we can think of Ei^'^ as a sum over all paths on 
the hypercube which consist in fiipping any q/2 bits and fiipping them back in all possible 
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sequences. Thus, we define A{P) such that 

J2 A{P), (19) 

where P = {pi)f=i G is a vector whose i-th component specifies half the number of times 
bit i is fiipped (we take half the number since any bit that is fiipped must be flipped back.) 
Of course, specifying P does not uniquely specify a path, so A{P) involves a sum over all 
paths corresponding to P. 



B. Scaling of corrections at successive orders 

In this section, we will show that when evaluating eigenvalues of the Hamiltonian H{X) 
around A = by perturbation theory as described in the previous subsection, corrections 
for successive orders are all of order G(A^). Since all corrections are of the same order, this 
means that for large A^, the leading behavior is given by the first non-zero correction in 
the expansion. This also suggests that the range of A for which the leading order in the 
perturbation expansion gives an accurate approximation is A^-independent (this statement 



will be discussed in Section VII ) 



Let us consider the corrections E^^ in Eq. (19). We denote as S{P) C [A^] the set of bits 



that are fiipped at least once in the paths corresponding to P, i.e., S{P) = {i G [N] : pi > 0}. 
By extension, S{P) then also defines, via the matrix (Jij), a graph where vertices correspond 
to elements of S{P). In order to show that all corrections E^^ scale as 6(A^), we prove that 
we do not need to consider all vectors {P : YliPi ~ but only those associated to 

connected graphs S{P) (we say that the graph is disconnected if S{P) can be expressed as 
a disjoint union 5*1 U 5*2 such that J^j = for all i E Si and j G 5*2). 

Lemma 2. Let Pq G be such that the graph associated to S{Po) is disconnected. Then, 
A{Po) = 0. 

Proof. Let A = (Ai, . . . ,Xn) denote a multi-dimensional perturbation parameter, and let us 
consider the following generalized Hamiltonian 

N N N 

i=l iii=l i=l 
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It can be seen that the perturbation expansion of the eigenvalues of this Hamiltonian can 
be written as 

oo 

KW = Y. E MP)U^?'^ (21) 

with the same coefficients A{P) as for £'x(A). 

Now, let S = S{Pq), and consider the Hamiltonian obtained from the generalized Hamil- 
tonian by substituting Xj = if j ^ 5, 

N N 

H\Xs) = Ml - - ^ i?,o-« + - J2 J^3^^^^^ - E ^^^'^^ (22) 

1=1 id=l i^S 

—* —* 

where is the vector obtained from A by performing this substitution. It is easy to see 
that the perturbation expansion of the eigenvalue corresponding to assignment x is given by 

oo 

^x(A5) = E E ^iP)U^"^ (23) 

9=0 p.rEiPi=<j/2 i 
\ S(P)CS 

again with the same coefficients A{P) as above. Now, observe that the operators a^z^ for 
i ^ S commute with the Hamiltonian H'{Xs). Therefore, the bits outside of S fall out of the 
dynamics, and it suffices to study the Hamiltonian obtained from H'{Xs) by substituting 
az ^ with (— , where Xj is the value of the j-th bit in assignment x, that is, 

H'sixs) = - ^ E b:-^ + z E J^^-^-^' - E >^^-^ (24) 



2 ^ ' ^ '4 



where 



b: = 5.-^E'^^^(-i)'^ (25) 

and we have ignored an irrelevant term proportional to 1. The eigenvalue Eg^{Xs) of this 
Hamiltonian then coincides with that of Hamiltonian H'{Xs), given in Eq. (23), up to this 
irrelevant constant. 

By assumption, the graph associated to S is disconnected, so there exist disjoint non- 
empty sets 5*1 and 5*2 such that S = SiU S2 and Jij = for all i E Si and j E 82- Therefore, 
we can write H'g{Xs) as 

H's{Xs) = H's^{Xs,) + H's^{Xs,). (26) 
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The perturbation expansion of the eigenvalue E'g^ xl-^s*) Hamiltonian H'g^ can be written 
similarly as above 

oo 

^k,x(A5j = E E MP)\{>^r- (27) 

9=0 p.rE,Pi=9/2 i 
S{P)CSfc 

— * — * 

Moreover, the Hamiltonians H'g^{\s^) and H'g^^Xs^) commute since they only act non- 



trivially on different qubits, so by Eq. (26) we have 



EU^s) = E's^^^iXsJ + E's^^^{Xs.J. (28) 



Since there are no terms proportional to Yli -^T expansion (27) for any P = {pi) such 



that S{P) = S, we must have that A{P) = for any such P. □ 

We now show that for connected graphs, the coefficients are finite. 

Lemma 3. Let Pq G such that the graph associated to S{Po) is connected and of size 
u = 0(1). Then, {A{Po)) = 0(1). 

Proof. Let S = S{Pq). Then, for any P such that S{P) C S, the amplitude A{P) in the 
perturbation expansion of the eigenvalue of H is the same as for the Hamiltonian Hg{Xs) in 



Eq. (24). From Jij = 2Bi and the fact that for any i E S, there exists j E S such that 
Jij > 1 (which follows from the fact that S is connected), we also see that ^ < < 3Bi. 
From Fact [l| this implies that {B'^} = 0(1). Since the Hamiltonian H'^ acts on a finite 
number of bits u, the perturbation expansion of its eigenvalues is independent, which 
proves the lemma. □ 

We may now prove the following theorem. 

Theorem 1. For any q = 0(1), the q-th order correction E^^ of an eigenvalue of Hamilto- 
nian H scales as Ie^^^) = Q{N). 



Proof. From Lemma [2| this q-th order correction may be written as 



' \ S{P) connected 



From Lemma [T| the number of terms in this sum is 0(A^) on average, and from Lemma |3| 
each of these terms is 0(1) on average, which implies the theorem. □ 
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V. AVOIDED CROSSING 



A. General idea 



In this section, we will show that for random instances of ECS, perturbation theory 
predicts that the spectrum of the Hamiltonian H{s) will exhibit an avoided crossing, and 
therefore a small eigenvalue gap, close to s = 1. The general strategy will be the following. 
We first consider an instance of ECS with at least two satisfying assignments which are 
isolated i.e., the Hamming distance between the solutions (the number of bits in which the 
two solutions differ) is of order G(A^). Then, we modify the instance by adding a clause 
which is satisfied by one of the solutions, but not by the other, which will now correspond to 
a local minimum of the new cost function. We show that this can create an avoided crossing 
between the levels corresponding to the solution and the local minimum, and therefore a 
small gap in the spectrum of H[s). 



B. Analysis by perturbation theory 



As detailed in Section IV A, perturbation theory allows to evaluate the energy Ey^(X) of 



an eigenstate corresponding to an assignment x as 

oo 
9=1 

For two solutions x-'^,x^, let Ei2{X) = Ei{X) — E2{X) be the splitting between the two 
energies, and e['^ be the g-th order correction to this splitting. We know from Section 



IV 



that the first non-zero correction to E^{X) yields the leading behavior even as increases 



since all corrections scale as G(A^). Moreover, recall from Section IV A that all odd order 
corrections are zero so that the first non-zero correction to E^i^X) appears at second order. 



From Eq. (18), we have 



y 



(x|//o|y)(y|/Jo|x) 

-E'x — Ey 



^ 1 
1 « 



(30) 



(31) 



where we used the fact that E^ = since x is a solution, and that Ey = 5^ if y differs from 
X only in the i-th bit, since assignment y will violate all the clauses where this bit appears. 
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Since this correction does not depend on the particular solution x we start from, we have 
E[f = 0, so the correction to the splitting £'12 (A) between two solutions can only appear at 



order 4. Eq. (16) yields 

N ^ ^ N 



(32) 



where Efj = By for the assignment y obtained from the solution x by flipping bits i and 
i [28j. When bits i and j never appear together in a clause, i.e., Jij = 0, we have Efj = Bi+Bj 



and the corresponding term in Eq. (32) is zero. Therefore, we only need to sum over i,j 
such that Jij 7^ 0, that is. 



Bi + Bj El 



(33) 



so that this expression only involves O(A^) terms, as was shown in Section IV The first 
non-zero correction for the splitting Ei2{\) is then given by 

N / . , \ 2 



p(4) 



so that Ei2{X) = e[^ +0(A^). For random instances, B^, Bj and Ef- will become random 



numbers. Since e[^ is given by a sum of B(A^) random terms with zero mean, we can expect 
the variance of e['^ to be of order O(A^), 

'{E[ir)^C^'^N, (35) 

and similarly for the p-th percentiles, 

P, ((Eit^r) ^ Cl'^N, (36) 

so that Pr[(i?j2^)^ ^ Cp'^^N] ^ 1 — In the next subsection, we will check by numerical 
simulations that this is a good approximation, and also estimate the constant C^^\ but from 
now on assume that this is correct. 

Therefore, for sufficiently large A^, we have that the energy difference l-E'i(A) — E2{X)\ 
becomes larger than 4 with probability ^ — ^'-'^ 

A > Ae = V2 (C^^^iV)-^/^ (37) 
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FIG. 1: Schematical representation of a level crossing, (a) Before adding the clause, we have 
— E2{X*) > 4. (b) By adding a clause satisfied by solution 1 but not solution 2, we create 
a level crossing since E[{0) < £2(0) but E[{X^) > E2{K). 

Suppose that -E'i(A*) — £'2 (A*) > 4 for a given A* > Ac (the situation is depicted in Fig. [T|. If 
we modify the problem by introducing one additional clause, we would still have E[{\^) > 
£^2(A*) since, by definition of Hp, one clause can only increase the energy by 4. However, if 
this clause is such that it is satisfied by but not by x^, we have -Ei(O) = and -^2(0) > ^1 
meaning that we now have a level crossing between A = A* and A = 0. 

If the Hamming distance n between the two solutions scales as the total number of bits 
A^, which is typically the case for instances close to the satisfiabihty threshold [TTl [18] , the 
crossing will only be avoided at the n-th order of perturbation theory, so that the minimum 
gap for the new problem will scale as A", which is exponentially small in A^. A more detailed 



study of the scaling of this gap will be provided in Section VI 



C. Numerical simulations 

To demonstrate the fact that (-£^(2'*)^ scales as 0(A^) and obtain an estimation of the 
slope, we performed the following numerical simulations. Since we are interested in hard 
instances, accepting very few isolated solutions, we fixed the clauses-to-variables ratio to 
a = 0.62, which is close to the satisfiability threshold a.^ [IH]. For each number of bits A^ 
from 15 to 200 by steps of 5, we generated 5000 random instances with M = \_aN\ clauses, 
and then computed the energy splitting E[f between the ground state of H{X) and the level 
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FIG. 2: Statistics of the fourth order correction of the splitting {E^-^)"^. Each data point is obtained 
from 5000 ECS instances with a ~ 0.62. 

that would correspond to the ground state of the Hamiltonian obtained by adding a final 
clause (the details of this procedure are given in Appendix [b|). 

In Fig. ^ we plotted the mean and some percentiles of (£'(2'')^ obtained by our simulation 



as a function of A^. The mean closely agree with the linear regression in Eq. (35), with 
C(^) ^ 0.033. 

Recall that we have shown in Section 



IV 



that corrections E^'^ up to any order involves 



G(A^) terms. Therefore, just as for order 4, higher order corrections to the splitting squared 
{E^^2Y ^Iso expected to scale linearly in A^, so that the 4-th order gives the leading 
behavior of the splitting. To numerically confirm this, we also computed the 6-th order 
correction to the splitting for each instance. In Fig |3| we plotted the mean and some 
percentiles of {E^^2Y- expected, they also agree closely with linear regressions, and in 
particular, we obtain for the mean ^(-£^2^)^^ ~ C'^^-'A^ with C*-^-* ~ 0.44. This also allows us 
to give a very rough first approximation for the range of A where the perturbation expansion 
of the splitting 

ei2(a) = e;ja^+e;ja6+o(a«). 



gives an accurate estimation. Indeed, for the second term to be less than the first term. 



15 



120 








25 50 



75 



100 



125 



150 



175 



200 



N 



FIG. 3: Statistics of the sixth order correction of the spHtting (£'|2 Each data point is obtained 
from 5000 ECS instances with a ~ 0.62. 



we need A < y l-^lg 1/1-^12 I ~ ^r, and using the linear regression for the means yields 



A, ^ (C(^)/C(6))i/4 -0.52. 

VI. SCALING OF THE GAP 

In this section, we will study the scaling of the gap as the size of the problem increases, 
and confirm that the gap decreases exponentially. Recall how that gap is created (see Fig. [T]), 
and consider the modified Hamiltonian with the additional clause, which exhibits an avoided 
crossing for some A = Ac. Let us study what happens when we evolve adiabatically from a 
large A to A = 0. For A > Ac, the ground state corresponds to the energy level E'2{X), so the 
system is in the corresponding eigenstate |x^,A), while the energy level E[{X) corresponds 
to an excited state |x^,A). However, when A < Ac, the ground state now corresponds to 
the energy level E[{X), so this means that the system has to tunnel from |x^. A) to |x^. A). 
For small enough A, these states will be localized, so that the tunneling amplitude will be 
small if x^ and x^ are far apart. More precisely, the minimal gap A, that is, the width of 
the avoided crossing, may be evaluated in the regime of small A by computing the tunneling 
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amplitude 

Al2(A) = (xl|x^A) 

between |x^,A) and |x^), at A = Ac- We will show that this amplitude, and therefore the 
gap A itself, becomes exponentially small if the avoided crossing happens for small enough 
Ac- This implies that unless the evolution is exponentially long, the system will not have 
the time to tunnel from |x^. A) to |x^. A), and therefore end up in the state |x^), which does 
not correspond to a solution but only to a local minimum. 

A. The Disagree problem 

We will show that computing the tunneling amplitude between two solutions of EC3 
reduces to computing the same quantity for an instance of the Disagree problem. An instance 
of Disagree over n bits consists in a set of clauses of the form (xj^ 7^ ^jc)^ where ic,jc £ N- 
As for EC3, a solution of Disagree is a bit string x e {0, 1}" satisfying all clauses, and 
therefore corresponds to the minimum of a cost function 

c 

Therefore, we can design an adiabatic quantum algorithm for Disagree using as final Hamil- 
tonian 

where, similarly to ECS, Jij is the number of clauses where bits i,j appear together, and m 
is the total number of clauses. 

Since each clause involves exactly two bits, an instance may be associated to a graph 
where each vertex i e [n] represents a bit, and each edge G [n]^ represents a clause 
{xi Xj). Note that unless the graph is bipartite, it may include odd cycles and therefore the 
corresponding problem would have no solution. Here, we will focus on instances associated 
to connected bipartite graphs, which admit exactly two solutions, where all bits are set to 
in one partition and to 1 in the other. Note that by negating all the bits in one partition, 
we may map such an instance of Disagree to an instance of Agree, where all clauses are of 
the type {xi^ — Xj^), and where the solutions are the all-0 and all-1 strings. The Agree 
problem has been previously studied in the context of adiabatic quantum computing, and 
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it has been shown that when the graph is a cycle, the gap is only polynomially small, but 
it can be made exponentially small by modifying the weights on the different clauses [TU] . 
Here we show that the Disagree problem is also relevant to the study of ECS. 



B. Reduction to the Disagree problem 

Claim 1. Up to leading order in perturbation theory, the tunneling amplitude between two 
solutions x^, of an instance of ECS over N bits, is the same as for an instance of Disagree 
over n bits where the associated graph is bipartite and n = dni'x.^,^'^) is the Hamming 
distance between the solutions. 

Proof. By perturbation theory, the amplitude of the x^ to x^ transition is given by 

where the sum is over bit strings y* G {0, 1}^. Since x^ is a solution, we have Ey^2 = and 
E^2yi = —Eyi. Moreover, we have \4y = unless rf/^ (x, y) = 1, so each two successive strings 
in a path (x^, y^, y^, . . . , y""^, x^) should only differ in one bit, otherwise the corresponding 
term is zero. Let us assume w.l.o.g. that the bits are labeled so that x\ ^ xj for 1 <i <n, 
and x\ = otherwise. For a given permutation p in the symmetric group S'„, we will denote 
by y*^^'-'^ the string obtained from x^ by flipping bits p(l),p(2), . . . ,p{j). We may now write 
the tunneling amplitude as 

1^121 = yY. p F ^ J, + 0(A"^^). (39) 



Ey{p,l)Ey(p,2) . . . Ey{p,n-1) 



Note that for all the strings in this expression, the bits i/i' = x\ = xf are constant for 
i > n + 1, so these bits are irrelevant and we may consider the restriction of the ECS 
instance where these bits are fixed to the same value as for x^ and x^. In that case, all 
clauses only involving irrelevant bits are trivially satisfied, since they are satisfied for the 
solutions x^,x^, so these clauses are also irrelevant and may be discarded. Starting from 
a satisfying assignment for three bits in an ECS clause, flipping one or three of these bits 
always makes the clause violated, so relevant clauses may only involve two relevant and one 
irrelevant bit. Moreover, the relevant bits have to differ in the original assignment to keep 
the clause satisfied as we flip both of them, so the two relevant bits icjc in a relevant 
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clause are such that xj^ ^ x]^ (similarly for x^). Therefore, discarding irrelevant bits, each 
relevant clause reduces to a Disagree clause (xj^ 7^ ^jc)- Since the instance of Disagree that 
we obtain admits two solutions (the restrictions of x^,x^ to the first n bits), the associated 
graph does not contain any odd cycle and is therefore bipartite. □ 

Since the graph associated with the Disagree instance is bipartite, we may further reduce 
it to an Agree instance (by negating all bits in one of the partitions). Therefore, it suffices 
to study the tunneling amplitude for the Agree problem. 



C. Upper bound for the tunneling amplitude 

Lemma 4. For any connected graph G on n vertices, the tunneling amplitude between the 
all-0 and all-1 solutions of the associated Agree instance is \Ai2\ < ^(2A)" + O^X""^^). 

From there, we can conclude that if an avoided crossing happens at A = Ac the minimal 
gap will scale as 

A^o((^)"). (40) 

for some constant A^ > |, so that it becomes exponentially small if Ac < |. In Appendix [c| 
we also provide an estimation of the tunneling amplitude and show that the estimation 
matches the upper bound up to a constant. In particular, we find A^ ~ 0.81 for a = 0.62. 

Proof. For a given permutation p E Sn, let Ap = 11^=1 (-^ytp-^))^^- need to prove that 
^p65„ — We will actually show that for any tree, J2peSn^p ~ 2""^. Since the 

energies -Ey(pj) can only decrease when removing clauses, the amplitude for a spanning tree 
of a graph is an upper bound on the amplitude for the graph itself, so this implies the lemma. 

We prove that Ylp<^s„ ~ ^""^ for all trees by induction on the size of the tree n. For 
n = 2, the only tree consists in two vertices connected by an edge, corresponding to a unique 
clause (xi = X2). There are only two permutations p G 5*2, and since fiipping either bit will 
violate the clause, we have Ap = 1 for each p and finally A12 = 2. 

Suppose that J2peSn ~ ^""^ trees of size n. We show that this implies that 

Xlpe5 +1 ~ 2" for all trees of size n + 1. The set of trees of size n + 1 may be generated by 
attaching an additional vertex to any possible vertex of any possible tree of size n. Let us now 
consider a particular tree of size n, and suppose we attach an {n + l)-th vertex to the vertex 
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number k. Let us represent a permutation p G Sn as a vector p = . . . ,p{n)). Then 

all permutations p' G Sn+i may be obtained by considering all permutations p E Sn, and 
inserting element (ri + 1) in all possible positions. Let pi G Sn+i be the permutation obtained 
from p E Snhj inserting element {n + 1) after element p{l), that is, pi = (p(l), • • • ,p{l),n + 
l,p{l + 1), . . . ,p{n)), where < I < n (for / = 0, the element is inserted in first position, 
before We then have X]p'es„+i ^p' ~ X]p65„ Ym=o ^pr ^^^^ show that for any 

p G Sn, we have Ya=o ^pi ~ 2^p, whatever the tree of size n we start with and the vertex k 
where we attach the additional vertex. This allows to conclude the proof. 

Let us fix a permutation p & Sn, and write Ej = Ey(p,j), so that Ap = 11^=1 (-^i) "*^- 
need to compute Ym=o ^pi the new Agree instance, obtained by attaching an (n + l)-th 
vertex to the vertex number k of the associated graph, or equivalently introducing a new 
clause {xk = Xn+i)- Let k' = p~^{k). Starting from the all-0 solution, let us study how the 
energy changes as we flip bits in the order specified by pi. For < / < A;' — 1 the additional 
clause will be violated as soon as we flip the new bit n + 1 (in position Z), and until we flip 
bit k (in position k'). Therefore, we have 

I fc'-l n-l 

j=l j'=l j"=k' 

and it is straightforward to check that X]f=o^ ^pi ~ YYjZi{Ej)~^ = Ap. Similarly, for k' < 
I < n we also obtain J2^=k' ^pi ~ ^P' therefore X]r=o ^pi ~ claimed. □ 

VII. ANDERSON LOCALIZATION AND THE APPLICABILITY OF PERTUR- 
BATION THEORY 

As shown in Section |V| the spectrum of H(X) presents avoided crossings for A close to 
zero, and the position of the first avoided crossing Ac is expected to scale as 0{N^^^^). The 
presence of an avoided crossing makes the perturbation expansion divergent for A > Ac- This 
means that, strictly speaking, the convergence radius of the perturbation theory scales as 
A, = 0{N-^/^), which tends to zero as becomes large. 

However, this does not imply that we can not use perturbation theory to estimate the 
spectrum of H{X) beyond Ac. For A > Ac, the perturbation expansion will be asymptotic 
rather than convergent. While asymptotic expansions do not generally converge, they may 
still provide accurate approximations as long as only a finite number of terms are considered. 
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In our case, the divergence is due to an avoided crossing between two levels corresponding 
to assignments at a Hamming distance n — Q{N) from each other. This means that the 
divergence will only appear at order n — 9(iV) in perturbation theory, so that successive 
orders of the perturbation expansion will provide better and better approximations to the 
spectrum of H[X), as long as wc stop at a finite order. On the other hand, for orders higher 
than n, the expansion will start diverging and the approximation will become less and less 
accurate. The following lemma may be used to obtain an upper bound on the error of the 
estimation of an eigenvalue. 

Lemma 5. Let {ip) and E be estimations of an eigenstate and corresponding eigenvalue of 
a Hamiltonian H. Then H admits an eigenvalue E such that \E — E\ < \\{H — 

Proof Let e = - By definition, we have e^ = {ip\{H-Elf\i)). This implies 

that {H — Eiy has an eigenvalue such that < < e^. In turn, H — El must have an 
eigenvalue e with — e < e < e or, equivalently, H has an eigenvalue E = E + e. □ 

If the estimations are obtained from g-th order perturbation theory, this implies that the 
error on the eigenvalue may be bounded by computing the (g + l)-th order corrections to 
the eigenvalue and eigenstate. If the (g + l)-th order corrections are small compared to the 
g-th order estimation, we still obtain a perfectly valid estimation, even if we are outside of 
the convergence radius. 

Of course, when A becomes much larger, perturbation theory will eventually fail as the 
error due to higher orders will not be small compared to the estimation anymore. The success 
of perturbation theory relics on the fact that the perturbed eigenstate is close enough to the 
unperturbed one. In our case, an unperturbed eigenstate corresponds to a basis state |x), and 
the perturbed eigenstate will be close when it has a large overlap on |x), and a smaller overlap 
on other basis states |y), decreasing exponentially with the Hamming distance dij(x, y). 
Such a state is called a localized state. Therefore, the validity of perturbation theory is 
intimately related to the phenomenon of Anderson localization. 

Note that the Hamiltonian H{X) = Hp + XHq may be written as 

H{x) = ^x|x)(x|-A Yl (41) 

xe{0,l}JV x,y:djf(x,y)=l 

where the second term (kinetic energy) describes a particle hopping on the vertices of a hy- 
percube, and the first term (potential) adds random energies to each vertex of the hypercube 
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(here the randomness comes from the distribution of random instances). This expression 
is therefore very similar to the Hamihonian used by Anderson to demonstrate his famous 
locahzation effect in disordered systems [12], the main difference being that the geometry 
of the system is the A^-dimensional hypercube instead of the (i-dimensional lattice. The key 
feature of Anderson localization is that for large enough disorder, or in our terms for small 
enough A, the eigenstates of H{X) are localized (i.e., the particle is bound to a vertex of the 
hypercube) so that perturbation theory is applicable in this regime. On the other hand, for 
large A, the first term (corresponding to the disorder) is negligible and the eigenstates of 
if(A) are close to the eigenstates of Hq, describing waves travelling through the hypercube. 
Therefore, for large A, the eigenstates of i^(A) will have amplitude over all basis states |x) 
and be called extended. 

The transition from localized to extended states marks the failure of perturbation theory, 
and the position A^ of this transition may be viewed as a weak notion of convergence radius, 
as shown by Anderson. To circumvent the divergence of perturbation theory due to avoided 
crossings close to A = 0, Anderson proposed the following resolution. He added a small 
imaginary part irj to all energies i?x of the unperturbed Hamiltonian. After this, perturbation 
theory provides a convergent series, which Anderson called locator expansion. Then, he took 
the limit 77 — 0, but only after taking the limit N —* 00. Recall that the original divergence 
of the perturbation expansion started at order n = Q{N). By taking N ^ 00 before 1] 0, 
this keeps the series convergent and the radius of convergence A^ then corresponds to the 
transition from localized to extended states (note that if we take the limits in the other 
order, we would obtain the original radius of convergence Ac due to avoided crossings, which 
goes to zero as 0{N^^^^)). 

For conventional Anderson localization, the contribution if] to the energies can be due 
to a weak coupling to a thermal bath, or to a possible escape of the particle through the 
boundary of the system. In the case of adiabatic quantum optimization, the basic idea is 
that ih/T (where T is the total computation time) would play the same role as ir]. Indeed 
let A be the eigenvalue gap created by an avoided crossing. During an evolution of time T, 
the system cannot resolve energies smaller than h/T. If A ^ fi/T, the system will therefore 
behave as if the avoided crossing was an actual crossing between two decoupled levels, and 
the divergence of the perturbation expansion due to this avoided crossing will disappear. 
We have shown in the previous section that A is exponentially small, which implies that 
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this argument is consistent unless we consider an exponentially long time T > h/A. While 
a complete analysis of Anderson localization in adiabatic quantum optimization could be 
the subject of an article on its own, the only property that we need is that the position 
of the transition decreases slower than the position of the first avoided crossing Ac- For 
Anderson localization on a d-dimensional lattice with random energies of order E, Xj. is 
believed to scale as Q{E/ {d\nd)) [20l [21]. In our case, both the degree of vertices and 
the random energies scale as Q{N), so we can expect by analogy that A^ = Q{l/lnN), 
which is sufficient for our result since Ac = 0{N^^^^). This is probably pessimistic since the 
conventional Anderson localization considers the critical A for which all eigenstates are still 
localized, while we are only interested in low energy eigenstates which remain localized much 
longer (as the density of eigenvalues is much larger in the middle of the spectrum, this is 



where the first extended states appear). Moreover, since we have shown in Section IV that all 
terms in the perturbation expansion scale as Q{N), meaning that dividing the expansion by 
N gives an expression which is essentially A^-independent, it is actually likely that A,. = 0(1) 
for low energy states. 

VIII. CONCLUSION AND DISCUSSION 

Putting everything together, we conclude that the adiabatic quantum optimization algo- 
rithm fails for random instances of ECS because of the presence of an exponentially small 
gap close to the end of the evolution (at s = 1 or A = 0). This result relies on different 
elements. In Section [TV] we have shown that all corrections in the perturbation expansion of 
eigenenergies of H(\) scale as G(A^). In Section |V| we have shown that perturbation theory 
predicts an avoided crossing to occur for some Ac = 0{N^^^^), and confirmed this scaling by 



numerical simulations. In Section VII , we have also argued that by analogy to the usual An- 
derson model, we can expect states to be localized for small enough A < Ar = Q{l/\ogN), 
so that perturbation theory is applicable in this regime. Since, as A^ increases, Ac decreases 
faster than A,., for large enough A^ the expected avoided crossing occurs for Ac < A^, where 
the perturbation expansion is accurate, so that the prediction of perturbation theory be- 



comes valid. Finally, in Section VI, we have shown that the gap induced by an avoided 
crossing at Ac will scale as A = 0{Xc/Xa)"', where Xa is a constant and n is the Hamming 
distance between the assignments corresponding to the two levels. For large instances close 
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to the satisfiability threshold random instances will only have a few isolated solutions 
which are essentially independent from each other, so that n = Q{N). Moreover, since 
Ac = 0{N^^^^), this implies that the gap will scale as A = 2^'^^^^°^^\ and therefore the 
algorithm will fail unless it takes (super-)exponential time. It is interesting to note that 
this scaling matches a lower bound proved by van Dam and Vazirani (see |12j, the proof 



is given in Appendix D for completeness), so that the typical gap for random instances is 
really as small as it can get. The fact that the gap is actually super-exponentially small also 
implies that the complexity of the AQO algorithm will be even worse than classical solvers 
(including the naive algorithm that exhaustively checks all possible assignments). 

While our results predict an exponentially small gap to occur closer and closer to the end 
of the evolution (at s = 1), an obvious question is why this was not observed in previous 
numerical simulations of the gap, where it seemed to scale polynomially and occur close to 
a fixed s, at least for small (up to A^ < 20 in [3 [22], 60 in [23] and 128 in [21], each using 
different methods). As for the position of the gap, note that it scales as Ac = 0{N~'^^^), 
so the dependence on A^ is weak and we need to consider a very broad range of A^ to 
observe it. More importantly, A^ must also be large enough so that the avoided crossing 
happens in a region where perturbation theory is applicable, i.e., Ac < A^, and where the 
tunneling amplitude decreases exponentially, i.e., Ac < \a- Since we have found the condition 
Ac > -\/2 (C^'^^A^)"^/^ for an avoided crossing to occur at Ac, and obtained the estimations 
Xr ^ 0.52 and A^ ^ 0.81, this implies the bound A^ > ^^\^^s ~ 86000 bits to observe 
the exponentially small gap predicted by our approach. While this number is very high, 
practically the exponentially small gap could appear much faster. A first reason is that in 
many cases, an additional clause will only increase the energy by 1, and not the maximum 
4, due to the form of the cost function in Eq. ([T]). If we only impose that Ei2{\) > 1, we 
see that small gaps already occur with high probability for A^ > -^tjt)^ ~ 5400. A second 
reason is that in our numerical simulations, we only considered avoided crossings created 
by the addition of the last clause, but any of the M — 1 other clauses could induce other 
avoided crossings, so that it could be possible to observe this exponentially small gap as 
soon as A^ is of the order of a few hundred bits. In particular, let us note that very recent 
numerical simulations up to A^ = 256 by Young et al. have revealed that more and more 
instances exhibit a first order phase transition and therefore a very small gap [25J, which 
could be due to the mechanism described in the present paper. In Figure |4| we plotted a 
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FIG. 4: Simulation of a level crossing for a random instance with N = 200 bits and a « 0.62, 
obtained by fourth order perturbation theory. Inset: To make the crossing more apparent, we 
plotted the energy differences Ei — E2 and E2 — Ei. 

level crossing predicted by fourth order perturbation theory for an instance with = 200 
obtained during our numerical simulations. The crossing occurs at A ~ 0.51 (possibly inside 
the region where perturbation theory is valid), and the corresponding assignments are at 
distance n = 60 from each other. 

While our argument immediately implies that a particular quantum adiabatic algorithm 
for ECS will fail, it is important to note that it also applies to more general cases. In 
particular, it does not rely on the specific form of the problem Hamiltonian Hp, but rather on 
its general statistical properties, so that it should extend to other NP-complete problems such 
as 3-SAT. Moreover, our argument does not rely on the precise form of the initial Hamiltonian 
Hq either, or even on the possible path between the initial and the final Hamiltonian, but 
only on the behavior of the perturbation in the vicinity of the final Hamiltonian Hp, and 
we would actually obtain similar conclusions for any perturbation acting locally on the 
qubits. Therefore, any adiabatic quantum algorithm aimed at solving a similar NP-complete 
optimization problem by mapping its cost function to a final Hamiltonian would eventually 
fail for the same reason. Actually, for other problems the situation could be even worse, in 
that the small gap could occur for smaller instances than for ECS. Indeed, while for ECS the 
first non-zero correction to the splitting Ei2{X) between two solutions only appears at order 
4 of perturbation theory, for many other problems, such as S-SAT, a similar Hamiltonian 
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would already exhibit a splitting at order 2, so that the position of the avoided crossing 
would scale as Ac = 0{N''^/^), and therefore this avoided crossing would happen inside the 
region where perturbation theory is valid for even smaller instances. 
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APPENDIX A: PROOF OF LEMMA [l] 

The proof of the lemma relies on the following claim. 

Claim 2. For any u E N, there exists g^yQu ^ ^ ^'^^ E N such that %^ < 

u 

Pr[ '\u] is connected"] < for any N > N^. 

We first show that this implies Lemma [TJ and then prove Claim [2] 

Proof of Lemma\l\ By definition, we have Gu = J2sc[n]-\s\=u -^["'S' is connected"], where I[£] 
is the indicator variable of event S. Since all subsets of size u are equivalent up to some 
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permutation of the bits, we have 

= J2 is connected"] = ( ^ J Fi[''[u] is connected"], 

which, together with Claim [2| imphes that (Gu) = Q{N). □ 

Proof of Claim^ Let us start with the lower bound. For subsets 81,82,8?^ C [A^], we say 
that a clause (xj^, Xjj, Xjg) is of type {81,82,8^) if ik G 8k, for k = 1,2,3. Let 

u 

" N{N-1) 

be the probability that a given random clause is of type {[u], {u + 1}, [A^]), and let us denote 
by Conn{8) the event "S* is connected". A sufficient condition for [u + 1] to be connected 
is that for each q G [u], there is exactly one clause of type ([g], {q + 1}, [A^]). Therefore, we 
have 

A/f' " 

PT[Conn{[u + 1])] > 7^^^— -y(nP.)(l - E^^)'""" (Al) 

^ q=l q=l 

^ MM f {M-u)u{u + l) \ 

- {M - u)\N^{N - 1)^ \ 2{N-1) J ^ ' 

> ^> (A3) 

where we have assumed that A^ > au{u + 1) + 1 so that the second factor is larger than 1/2, 
and defined Cu+i as 

f N \ MM 



u + l) 2(M - u)\N^+^{N - ly 
Note that C2 = a/4 and 

^ ^ Cuu{N - u){M - u + I) ^ C> 
{u + l)N{N-l) - 8 ' 

for A^ > 2m — 1 and M > 2u — 2. Therefore, the lower bound holds with g2 = a/4, 
9u+i = 9u^l^ A^ sufficiently large. 

As for the upper bound, we prove it by induction on u. For m = 1, we have 
Pr["[l] is connected"] < 1 so the claim holds with = 1. Let us denote by "(7/(5*1, 'S'2, 5*3)" 
the event that there exists at least one clause of type {81, 82, 83) within the M clauses. Now 
assume that Pr [Conn ([«])] < for some constant gl^. We need to bound the probability 

\u) 

that [u + 1] is connected. If [u + 1] is connected, there exists some vertex z G [m + 1] such 
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that the graph obtained from [u + 1] by removing vertex i, i.e., [m + 1] \ {i}, is connected as 
well. Therefore 

PT[Conn{[u + l])] 

<6 J2 PAConn{[u + l]\{t})ACl{{t},[u + l]\{i},[N])] 

ie[u+i] 

= Q{u + 1) FT[Conn{[u]) A Cl{{u + 1}, [u], [N])] 

< 6{u + 1) FT[Conn{[u]) A Cl{{u + 1}, [u], [N] \ [u])] 

+6{u + 1) FT[Conn{[u]) A Cl{{u + 1}, [u], [u])]. (A4) 
For the first term, we have 

PT[Conn{[u]) A Cl{{u + 1}, [u], [N] \ [u])] 

= Pt[CI{{u + 1}, M, [N] \ [u])] ■ PT[Conn{[u]) \ Cl{{u + 1}, [u], [N] \ [u])] 

< Ft[CI{{u + 1}, [u], [N] \ [u])] ■ FT[Conn{[u])] 

< M ^(^~ ^ ~^) . Pr[Conn([u])] 
- N{N -1){N -2) ^ 

<^^.Pr[Conn(M)]. (A5) 



Similarly, for the second term of Eq. (A4), we have 



Fi[Conn{[u]) A Cl{{u + 1}, [u], [u])] 



< 



Fi[Cl{{u + 1}, [m], [u])] ■ Fi[Conni[u]) \ Cli{u + 1}, [u], [u])] 

au{u - 1) Ft[CI{[N], [u], [u]) \ Conn{[u])] ■ FT[Conn{[u])] 
{N-l){N-2) Ft[CI{[N],[u],[u])] 
au{u — 1) FT[Conn{[u])] 
= {N-l){N-2)Fi[Cl{[N],[u],[u])y 

We need to upper bound Pr[C/([A^], [u], [u])]. Let 

u{u — 1) 



(A6) 



N{N-1) 

be the probability that a given random clause is of type ([A^], [u], [u]). For > 2au{u—l) + l, 
we obtain 

^ r^,/r,xn r i r ,r / ,^ N au(u—l) au(u—l) 

FT[Cim, H, [u])] > Mg.(l - Mg„) > ^^^^ > ^^^^ _ 
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Therefore, the second term in Eq. (A4) satisfies 



PT[Conn{[u]) A Cl{{u + 1}, [u], [u])] < 



4 



■ Pr[Conn([M])]. 



N-1 



Putting everything together, Eq. (A4) imphes 



PT[Conn{[u+l])] < 



< 



6{u + l) 
N-1 

6{u + l) 
N-1 



{au + 4) Fi[Conn{[u])] 



{au + 4) 




\u+l/ 



(AT) 



so that the upper bound holds with = 1, g^^i = 6g^{au + 4:), and sufficiently large. □ 

APPENDIX B: DETAILS OF THE NUMERICAL SIMULATIONS 

For each number of bits A^ from 15 to 200 by steps of 5, we generated 5000 random 
instances with M = \_aN\ clauses using the following procedure. 

We first picked M — 1 triplets of bits (the clauses) uniformly at random and with replace- 
ment, and cleaned the corresponding ECS instance by removing absent bits, and clauses 
involving two bits appearing in no other clause, which introduce trivial degeneracies in the 
space of solutions. We then solved the cleaned ECS instance P using the Davis-Putnam- 
Logemann-Loveland algorithm [26l |27] . to find all of its satisfying assignments (note that 
this step takes exponential time and therefore prevents us from considering too large in- 
stances). We added a final random clause to create a new instance P', and checked which 
one of the solutions of P satisfied the new clause, therefore being also solutions of P'. For 
each of the solutions of P', we computed the fourth order correction to the energy 
(under Hamiltonian H'{X)), and identified the solution having the minimum correction, 
which corresponds to the ground state of H'{X) for A > 0. We then computed the correction 
under H{X), not only for but also for all solutions of P not being solutions of P', 
in order to identify the solution x^ having the minimum correction. Finally, we calculated 
the 4-th order correction e[^ of the splitting between energies of assignments x^ and x^ 
for the instance without the added clause P. Indeed, if this splitting is large enough, this 
creates an avoided crossing between the ground state |x^. A) and the excited state |x^. A) of 
the Hamiltonian H'{X) for the instance with the added clause. 



30 



Note that if any of the previous steps failed for some reason (e.g. P has no solution, or all 
solutions of P are also solutions of P'), we rejected the instance and started over with M — 1 
new random clauses. Nevertheless, after having generated 5000 valid instances, we checked 
that the number of rejected instances stayed approximately constant as N increases, so that 
our conclusions are verified with constant probability over the random instances. 

The numerical simulations were run using a parallel algorithm on 2 nodes of the NECLA 
cloud, where each node has access to two dual-core Intel Xeon 5160 (3GHz) CPUs and 
3.5GB of RAM. The full set of simulations took between 3 and 4 days to complete. 



APPENDIX C: ESTIMATION OF THE TUNNELING AMPLITUDE 

The case of a tree provides an upper bound on the tunneling amplitude between two 
solutions of an Agree instance over n bits, and as a consequence an upper bound on the 
tunneling amplitude between two solutions of an EC3 instance with Hamming distance n. In 
this appendix, we will derive an estimation (rather than an upper bound) of this tunneling 
amplitude. Let us consider an arbitrary Agree instance over n bits with m clauses. Let Bi be 
the number of clauses where bit i appears, and Jij be the number of clauses where bit i and 
j appear together (there is actually only one such clause for the Agree problem, {xi = Xj), 



but in general this clause could be repeated). Following the notations of Section VIC 
let Ej = Ey(p,j) be the energy of the bit string y'-^'-'^ obtained from by flipping bits 
p(l),p(2), . . . ,p{j) for a given permutation p G Sn, and let Ap = 11^=1 (-^i) "*^- We need to 
estimate 

1^121 = A" ^ + 0(A"+i) 

= A"n!(A,)^ + 0(A"+i), (CI) 
where the average is taken over permutations p E Sn- Let us consider the approximation 

/n-l ' 
n-l 

- n (^^■);' • (C2) 

We need to compute the average energy Ej of a bit string obtained by flipping j bits from 
a solution. When a particular bit is flipped, each clause where the bit appears will become 
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violated, but it will be satisfied again when the second bit in the clause is flipped, so that 

k=l \ 1=1 / 

Since, by deflnition, = 2m and Yl]=i'^ij = ^i, we have <^-Bp(fe)^ = ^ and 



Jp{k)p(i))^ = ;^(^ (for any kj E [n]), and inserting in Eq. (|C3|) yields 



p 



n{n — 1) 

This shows that the average energy barrier that has to be broken to tunnel from one solution 
of Agree (and by reduction of ECS) to another is shaped as a parabola, see Fig. |5} We may 
now estimate the tunneling amplitude 

n-l 



A"n / n 



n— 1 

(C4) 



(n-l)! V 2/? 

where we have deflned the clauses-to-variables ratio P = ^ for the Agree instance. Using 
the Stirling formula n\ ^ n"e~", we obtain for large n 

2(3n fe\ 

which is therefore exponentially small in n as soon as A < ~ 

Since we are more interested in ECS than Agree, we need to estimate the typical value of 
(3 for the Agree instances obtained by reduction from ECS instances. Let us consider random 
instances of ECS with bits and M = aN clauses. We have seen in Section IIIII that the 
clauses forming such a random instance will typically only involve about N'^^^ = N{1 — e~^") 
of the A^ bits. Let us discard the absent bits and focus on assignments of the remaining 
bits satisfying all clauses, that is, the solutions of the instance. Since, for each clause of S 
bits, one of the bits has to be set to 1 and the other two to 0, we expect that solutions will 

A^' 2N' 

typically involve about —j^ I's and — ^ O's. Therefore, if we look at a random bit Xi in a 
solution, we would expect that Prj[a;j = 1] = |. Let us now consider the joint distribution 
Pbife2 = PTi[{xj,xf) = (6^,6^)] for two solutions x^,x^. For instances with large A^ and a 
close to the satisflablity threshold, the solutions will be essentially independent, so we can 
expect pfeife2 = FTi[x} = b^] ■ Pr^x.^ = b"^], that is, ]3oo = |, Poi = Pio = § and Pii = I- From 
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FIG. 5: Average energy barrier for random paths between the two solutions of an instance of Agree 
with n = 20 bits and m = 20 clauses. {Ej)^ gives the average energy after flipping j bits from one 
of the solutions. 



there, we can estimate the typical Hamming distance between two solutions of a random 
ECS instance, rityp = A^typ(Poi + Pio) = 

Out of the M clauses, we now need to estimate the number of relevant clauses, that is 
those involving bits that differ in the two solutions, since these are the clauses corresponding 
to agree clauses in the Agree instance obtained by reduction. Let us consider the set of all 
clauses satisfied by the two solutions, and assume that we sample M clauses uniformly at 
random from this set. We say that a bit Xi is of type b^b^ when {x},xf) = {b^,b'^). A 
relevant clause involves one 00-type bit, one 01-type bit and one 10-type bit. Therefore, 
there are typically about Mrei = 6pooPoiPio(A'typ)'^ possible relevant clauses. Similarly, an 
irrelevant clause involves one 11-type bit and two 00-type bits, so there are typically about 
Mirr = 3]9ii]9Qo(A^t'yp)'^ irrelevant clauses. A random clause satisfied by the two solutions 
will therefore be relevant with probability ^^^m. — \- As a consequence, when picking 
M random clauses, there will typically be around ^typ = ^ relevant clauses. Finally, 
the typical clauses-to-variables ratio for the Agree instance obtained by reduction may be 



estimated as /5typ 



ntyp 



2 



-j-. Back to our original concern about the tunneling 
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amplitude, we see that it will be exponentially small for A < Xa, where 

e - 1 • 

For a = 0.62 (the value we used in our numerical simulations), we obtain Xa ~ 0.81. 



APPENDIX D: LOWER BOUND ON THE MINIMUM GAP 

Lemma 6 ([12j). Let H{s) = (1 — s)Hq + sHp be a Hamiltonian on N quhits where Hq = 
— ^fLiO'x\ and Hp = ^^g|Q ^|jv -Ex|x)(x| with E^o = for some x° G {0,1}^ and 1 < 
E:x_ < M = 0{N) for all x 7^ x°. Then, the eigenvalue gap of H{s) satisfies A(s) > 
2'0(NiogN) for alio < s <1. 

Proof. Let us first consider the case s > 2n+2 - ^o^^ that the diagonal elements of H(s) 
are given by ifxx = sE^^, and the non-diagonal elements satisfy Xly^x-^xy = ~(1 ~ s)N. 
Therefore, all Gershgorin circles have radius (1 — s)N and the circle around Hy-o^o = is 
disjoint from the circles around i/xx > s (for x 7^ x*^) as soon as (1 — s)N < s — (1 — s)N, 
that is, s > 2^+2 - ^^sX case, the smallest eigenvalue lies in the circle around H^Oyfi = 0, 
and all others in the union of the other circles, so that we have for the eigenvalue gap 
A{s) = E,is) - Eois) > s - 2(1 - s)iV > ^ > fi(^) for s > fg. 

Let us now consider the case s < 2n+2 - Q ~ -h{s^-+sMi _ ^ _|_ \(^]\/[-]_ _ Hp), where 
A = —Hq is the adjacency matrix of the hypercube and A = jf^. Note that since Ey^ < M, 
all elements of this matrix are non-negative. By the mixing properties of the hypercube, 
all elements of A'^ are at least 1, and therefore this is also true for Q'^ . This implies 
that the gap between the largest and second largest eigenvalue of is at least 1, that is, 
f^o ~ 1^1 — 1) where /xq and /ii are the largest and second largest eigenvalue of Q. The 
eigenvalues /Xj of Q are upper-bounded by the spectral radius p{Q), which in term is upper 
bounded by the norm | IQI |i = maxx J2y I'^xyl, so that /ij < AM + for all i. Moreover, we 
have fiQ — Hi < (/xq — + so that 

1 1 1 

^° " - (/io + ~ (2AM + 2iV)^-i - (4MAr + 2M + 2Ar)^-i ' 



where we used the fact that A < 2N + 1 when s < 2n+2 ' Filially^ "we have 

1 

{2N + 2){AMN + 2M + 2N) 



□ 
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